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APPARATUS AND METHOD FOR MONITORING STREAMED MULTIMEDIA 
QUALITY USING DIGITAL WATERMARK 

Technical Field of the Invention 
The present invention relates generally to streaming multimedia, and in 
particular, relates to a digital watermark used to monitor the quality of streamed multimedia 
over a network. 

Background of the Invention 
Digital watermarks are currently commonly used to authenticate content that 
has the watermark. The digital watermark is typically embedded into the content by means 
of digital signal processing. In the case of multimedia content, the watermark is usually 
imperceptible to the human eye/ear when the content is reproduced. In this manner, the 
authenticity of the content can be determined without the watermark producing undesirable 
side-effects. 

The digital watermark is designed to be robust to digital manipulation so that 
if the content is changed slightly, the watermark is persistent. In other words, unless the 
quality of the content is significantly degraded or changed, the watermark is still recoverable. 
The watermark allows the ownership to be determined up to a degree where the modified 
content is likely to be no longer useful. In this manner, digital reproduction of the content is 
discouraged. Examples of digital watermark techniques in the context of content 
authentication are described in U.S. Patent No. 5,915,027 to Cox et al. and U.S. Patent No. 
6,101,602 to Fridrich. 

While digital watermarking has been used in the context of authentication, as 
detailed below, other applications have arisen with the increase of networking. For example, 
the explosion of the Internet in the past decade has caused a corresponding explosion in the 
delivery of multimedia content over the Internet. Audio and video clips are now being 
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routinely distributed over the Internet. Live broadcasts of events are now starting to become 
commonplace. One concern is the maintenance of content quality as the multimedia content 
is distributed over the Internet. For various reasons, it has been found that as content is 
packetized and routed over the Internet, degradation occurs. The present invention relates to 
the monitoring of this degradation. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
A non-limiting and non-exhaustive embodiment of the present invention is 

described with reference to the following Figures. 

Figure 1 is a schematic diagram showing one embodiment of a system for 

implementing the present invention; and 

Figure 2 is a flow diagram illustrating the method of the present invention. 
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DETAILED DESCRIPTION 

Embodiments of an apparatus and method for power continuity testing in a 
parallel testing system are described herein. In the following description, some specific 
details and examples are provided, to provide a thorough understanding of embodiments of 
the invention. One skilled in the relevant art will recognize, however, that the invention can 
be practiced without one or more of the specific details, or with other methods, components, 
materials, etc. In other instances, well-known structures, materials, or operations are not 
shown or described in detail to avoid obscuring aspects of the invention. 

Reference throughout this specification to "one embodiment" or "an 
embodiment" means that a particular feature, structure, or characteristic described in 
connection with the embodiment is included in at least one embodiment of the present 
invention. Thus, the appearances of the phrases "in one embodiment" or "in an embodiment" 
in various places throughout this specification are not necessarily all referring to the same 
embodiment. Furthermore, the particular features, structures, or characteristics may be 
combined in any suitable manner in one or more embodiments. 

As an overview, an embodiment of the invention is implemented on a 
network, such as the Internet, to monitor the quality of distributed multimedia content. A 
digital watermark is embedded into the multimedia content. Various monitoring clients are 
disbursed over the network. The monitoring clients receive the multimedia content and 
examine the digital watermark. Statistical information relating to the degradation of the 
content is then gathered by an analysis of the watermark. 

Referring first to Figure 1, shown generally at 10 is a system according to an 
embodiment of the invention. A network 12, such as the Internet, can implement the system 
10, but other types of communication networks may be utilized as well. For example, the 
network 12 can comprise a local area network (LAN), virtual local area network (VLAN), 
asynchronous transfer mode (ATM) network, or other network or portion of a network. 
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The system 10 includes one or more network servers 14 communicatively 
coupled to one or more terminals 16 via one or more links 18. The server 14 can provide a 
web site having hypertext markup language (HTML) web pages to the terminals 16, using a 
protocol such as hypertext transfer protocol (HTTP). The terminals 16 can comprise personal 
computers (PCs) to access the server 14. The terminals 16 each have a display screen 20 that 
allows users to view information sent to and from the server 14. The terminal 16 can include 
an input/output unit 22, such as a keyboard and mouse. The terminal may also include a 
processor 24, and a storage unit 26, which can be any type of machine-readable storage 
medium such as a read-only memory (ROM), compact disks (CDs), digital video disks 
(DVDs), magnetic tape, floppy disk, etc. 

Other types of terminals 16 besides PCs may be used. For example, the 
terminals 16 can include workstations (e.g., dumb terminals) connected to an internal 
computer network, enhanced-functionality wireless devices having display screens, laptops, 
television monitors, etc. 

The link 18 and other links in the system 10 can include any type of high- 
speed data line(s) or network(s) that can accommodate high-speed bit rates, including Tl, 
xDSL, SONET, ATM, Ethernet, etc. Telephone modem links may also be used. The link 18 
can comprise hardwire links (e.g., twisted pair, optical fiber, coaxial, etc.) or wireless links 
(e.g., radio frequency, cellular, satellite, microwave, optical, etc.). A person skilled in the art 
will further appreciate that the speed of transmission of data via the link 18 may also vary 
from one system 10 to another, based on factors such as type of network or communication 
medium, level of network traffic, size of files being transmitted, etc. Therefore, 
embodiments of the invention are not limited by the specific type of terminals, networks, 
communication medium, data rate, etc. that are used by the system 10. 

The system 10 can include one or more database systems 28 to store many 
types and formats of data. For example, the database system 28 can store recorded 
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multimedia content for distribution over the Internet. The database system 28 can be part of 
the server 14 or it can be a separate network component communicatively coupled to the 
server 14. The database system can include one or more database units 30-36. For example, 
the database unit 30 can store audio files, such as .ra (Real Audio) or .wma (Windows Media) 
files. The database unit 32 can store images, such as photographs having .gif, .tif, JPEG, 
MPEG, etc. formats. The database unit 34 can store video files. The database unit 36 can 
store many types of miscellaneous content, such as executable files or Power Point files. The 
database units 30-36 can be coupled to each other and to the server 14, as shown 
symbolically by a link 38, thereby allowing communication/linking and exchange of data 
between these components. 

In one embodiment, any of the database units 30-36 can store metadata that 
point to files (e.g\, image files, media files, etc.) located on a distributed network of storage 
arrays (within the network 12 or the server 14), instead of or in addition to, such files located 
in the database system 28. Those skilled in the art will recognize that other methods of 
storage can store the files in any of the database units 30-36, in units/components separately 
located from the database system 28, in both locations, or in any other suitable 
location/manner. Furthermore, although the database units 30-36 as described store recorded 
content, live broadcasts may also be distributed by server 14. For live content, a real-time 
encoder 40 may be used to encode the content. The encoder 40 would receive as input live 
audio and video and encode it to various streaming multimedia formats. The encoder is 
coupled to the server 14, as shown symbolically by a link 41, thereby allowing 
communication/linking and exchange of data between these components. 

While only four database units 30-36 are shown in Figure 1, it is understood 
that there can be any number of database units. Further, it is possible to combine one or more 
database units into a single database unit. The database units 30-36 can comprise any type of 
computer-readable storage media having conventional database hardware configurations and 
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associated data structures, or they can comprise, more or less, computer-readable storage 
media that can store and organize information other than in a "database-type" format. 
Accordingly, embodiments of the invention are not limited by the specific number, types, or 
arrangement of the database units. The manner of operation of the database units 30-36 to 
provide a user with customized online photo albums will be described in further detail below. 

The system 10 can include a user authentication unit 44. The user 
authentication unit 44 stores user identifications (IDs), passwords, security codes, or other 
information required for granting the terminals 16 with access to the server 14. Each time 
any of the terminals 16 requests access to the server 14, that terminal sends a user ID, for 
example, to the server 14, which then grants access if the user ID matches user IDs of 
authorized users. 

The server 14 may be communicatively linked to an administration unit 42. 
The administration unit 42 can be, for example, a company that provides and maintains the 
services of the server 14, including but not limited to, coordination of downloads into the 
database unit 32, providing user accounts and user IDs for the terminals 16, providing 
customer service support, processing billing and account information, etc. 

Additionally, at least one monitor station 46 is provided in the network 12. 
The monitor station 46 periodically requests content from the server 14. Although only a 
single monitor station 46 is shown, in actual implementations, multiple monitor stations 46 
would be placed at various nodes or locations of the network 12. As will be seen in further 
detail below, the monitor station 46 is used to monitor the quality of the content distributed 
by the server 14 over the network 12. 

As delivery of multimedia content over large networks such as the Internet 
becomes more common, the operators of the server 14 need a method for assuring that the 
network has an acceptable impact on the quality of the content delivered to an end-user. 
Often, the operators of the server 14 are not the owners of the content, but merely contract 
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with the content owners to provide the service. Thus, the content owners expect that the 
server 14 will deliver the content satisfactorily. 

In accordance with one embodiment of the present invention, the content to be 
served has integrated therewith a digital watermark. Thus, in Figure 2, at box 201, a digital 
5 watermark is integrated into the content. Typically, for live event content, the integration can 
be done by encoder 40. The process of integration of a digital watermark into content is 
known in the art and will not be further discussed herein. For pre-recorded events, the 
integration of the digital watermark into the content is normally done well prior to 
distribution of the service. In such as case, the content (with the digital watermark) is stored 

10 in databases 30-36. 

Next, at box 203, the digital watermark content to be served is analyzed to 
obtain a reference point for a "pure" content signal that has not been served. This allows a 
measuring stick to be obtained for determining the amount of degradation to the content due 
to propagation through the network. The analysis is typically performed by hardware or 

15 software located at the server 14. In many cases, the server 14 does not directly serve the 

content to an end user. Instead, the server 14 may distribute the content to an "edge serving 

site" for redistribution to the eventual end user. Thus, the term server 14 as used herein may 

encompass a conventional server for serving content to the end user, or may also be what is 

commonly known as a "broadcast operations center" (BOC) that serves content to a second 

20 layer of edge serving sites for eventual distribution to end users. 

Next, at box 205, the content is distributed (or served) over the network. At 

various locations in the network, at box 207, the distributed content is received by at least 

one monitor station 46. The monitor stations 46 can be specifically designed and placed at 

strategic nodes in the network. For example, the monitor stations 46 may be placed at edge 

25 serving sites or at the end-user locales. In any event, at box 209, the purpose of the monitor 

stations 46 is to capture the distributed content at various locations in the network and to 
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perform an analysis of the digital watermark to determine the extent of any degradation. Any 
degradation to the digital watermark would most likely correlate well to the amount of 
degradation in the actual content. The analysis information done at the monitor stations 46 
can then be stored for later retrieval or be automatically sent to the server 14 using any 
conventional technique. 



content is compared to the information on the digital watermark of the pure content. The 
comparison can be done using any one of many conventional statistical measures. Based 
upon this comparison, the quality of delivered content can be assessed. The comparisons can 
be used to improve delivery systems. For example, if all of the analysis information from 
monitoring stations using a certain portion of the network indicates severe degradation, a 
troubleshooting effort may be useful for the hardware for that portion of the network. 



technique is important. Generally, the digital watermark selected should be sensitive to 
degradations of several characteristics. For example, spatial degradation of the image (e.g. 
speckle, noise, and tearing) should be indicated. Temporal degradation, such as dropped 
frames and missing packets, should be indicated. System resource usage, such as CPU and 
memory utilization, should be indicated. Further, the digital watermark should be easily 
inserted into the content without heavy computation or signal processing. Thus, the digital 
watermark should be generated on a variety of systems without significant CPU loading. 
Finally, the digital watermark should be amenable to statistical analysis, so that meaningful 
statistics can be easily generated. 



Next, at box 211, the information on the digital watermark of the distributed 



As can be appreciated, the selection of a particular digital watermarking 
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A digital watermark is often used to establish the ownership and origin of the 
copyrighted content, even when the content may have been modified (e.g., cropping, re- 
sizing, orientation). At some point during modification (e.g., posterization), the watermark 
may be lost. Another goal is that the watermark be used to measure quality. When 
5 watermarking is used to measure quality, information about the image quality is needed. One 
method is to use a watermark image, or a sequence of watermark images (a "watermark 
video"). By using a known watermark image or watermark video, a known quality-control 
image can be subliminally added into digital content. 

Once content is available in digital format (content might first be captured 
rt 10 using a analog system, such a Betamax video tape format) the quality-watermark is added, 
resulting in quality-watermarked digital content. In one embodiment, the watermark is added 
when the digital content is available in its highest quality format; for example, a 30 
3 frames/second, 512 x 488 digital NTSC video, or a 25 frames/second, 356 x 288 common 

ft intermediate format (CIF) video. 

J=f 15 The quality-watermark should consist of two at least components: i) a way to 

™ determine time intervals, and ii) an image that can be used to gather statistics about image 

quality. In accordance with the invention, one method is to use a framecounter watermark. 
The framecounter watermark should be robust to encoding and decoding artifacts, therefore, 
it should be added in several places in the original image. Specifically, the framecounter 
20 watermark may be a checkerboard that covers most of the image. 

A second method is to use a known sequence of images, much like the old 
movie headers that counted down the last seconds before the actual film started. In one 
embodiment, a simple gray-scale wipe that moves diagonally between the upper-left and 
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lower-right comers may be used. The resulting "quality- watermark video" makes it possible 
to get a sense of image quality (spatial quality) and timing (temporal quality). 

A watermark extraction process can be used to recover the quality-watermark 
video. In the ideal case, the recovered quality- watermark video can be matched up perfectly 
to the applied quality-watermark. However, degradations to the composite video 
(watermarked content) will affect the recovered quality-watermark. By analyzing the 
recovered quality-watermark spatial and temporal defects, it is possible to determine the 
quality of the composite video. 

Logging and/or monitoring the recovered quality-watermark at a remote site 
(at sites in the network, or even at end-users) allows the multimedia system to gather data 
about the quality of the delivered multimedia stream. 

However, since current multimedia distribution systems have constrained 
bandwidth between the digital content and the end-users, the digital content is first encoded 
into different spatial and temporal resolutions. The encoded format is sent over a network to 
the end-users, and then decoded. So the encode-decode process will add additional artifacts 
(quality defects) that cannot be ignored. The multimedia distributor will want to know how 
to measure these as well. 

Since the composite digital content is not available to the users, a new quality 
baseline needs to be established that comprehends the degradations due to the encode-decode 
process. The multimedia distributor needs to run the encode-decode process in an ideal 
environment - one free of the artifacts introduced by the distribution network (packet jitter, 
lost packets, block errors, etc.). The output of the ideal encode-decode is put through the 
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quality-watermark extraction process; this becomes the quality-baseline used to compare 
actual multimedia delivery. 

As current multimedia distribution systems require several encoding formats 
to better match the end-user connection characteristics, there needs to be a quality-baseline 
5 derived for each encoding format. The types of tests to be performed on the extracted 
quality-watermark are: 

• Correlation with the expected quality-watermark image 
(quality baseline(s)), both spatially and temporally, 

• RMS deviations in spatial amplitudes from quality baseline(s), 
10 • Frequency deviations from quality baseline(s), 

• Size, in pixels, of deviations from quality baseline(s), and 

• Duration, in milliseconds, of pixel deviations from quality 

baseline(s). 

The above description of illustrated embodiments of the invention, including 
1 5 what is described in the Abstract, is not intended to be exhaustive or to limit the invention to 
the precise forms disclosed. While specific embodiments of, and examples for, the invention 
are described herein for illustrative purposes, various equivalent modifications are possible 
within the scope of the invention, as those skilled in the relevant art will recognize. 

These modifications can be made to the invention in light of the above 
20 detailed description. The terms used in the following claims should not be construed to limit 
the invention to the specific embodiments disclosed in the specification and the claims. 
Rather, the scope of the invention is to be determined entirely by the following claims, which 
are to be construed in accordance with established doctrines of claim interpretation. 
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